Chinese Hedge Scope Detection Based on Structure and Semantic Information
نویسندگان
چکیده
Hedge detection aims to distinguish factual and uncertain information, which is important in information extraction. The task of hedge detection contains two subtasks: identifying hedge cues and detecting their linguistic scopes. Hedge scope detection is dependent on syntactic and semantic information. Previous researches usually use lexical and syntactic information and ignore deep semantic information. This paper proposes a novel syntactic and semantic information exploitation method for scope detection. Composite kernel model is employed to capture lexical and syntactic information. Long shortterm memory (LSTM) model is adopted to explore semantic information. Furthermore, we exploit a hybrid system to integrate composite kernel and LSTM model into a unified framework. Experiments on the Chinese Biomedical Hedge Information (CBHI) corpus show that composite kernel model could effectively capture lexical and syntactic information, LSTM model could capture deep semantic information and their combination could further improve the performance of hedge scope detection.
منابع مشابه
Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
Hedge detection is used to distinguish uncertain information from facts, which is of essential importance in biomedical information extraction. The task of hedge detection is often divided into two subtasks: detecting uncertain cues and their linguistic scope. Hedge scope is a sequence of tokens including the hedge cue in a sentence. Previous hedge scope detection methods usually take all token...
متن کاملExploiting Multi-Features to Detect Hedges and their Scope in Biomedical Texts
In this paper, we present a machine learning approach that detects hedge cues and their scope in biomedical texts. Identifying hedged information in texts is a kind of semantic filtering of texts and it is important since it could extract speculative information from factual information. In order to deal with the semantic analysis problem, various evidential features are proposed and integrated...
متن کاملA Cascade Method for Detecting Hedges and their Scope in Natural Language Text
Detecting hedges and their scope in natural language text is very important for information inference. In this paper, we present a system based on a cascade method for the CoNLL-2010 shared task. The system composes of two components: one for detecting hedges and another one for detecting their scope. For detecting hedges, we build a cascade subsystem. Firstly, a conditional random field (CRF) ...
متن کاملExploiting Rich Syntactic Features for Hedge Detection and Scope Finding∗
Hedge detection and scope finding are increasingly important tasks in information extraction, especially in the biomedical natural language processing community. In this paper, a novel approach detecting hedge cues and their scopes by sequence labeling is explored. It should be emphasized that syntactic dependencies are systematically exploited and effectively integrated by a large-scale featur...
متن کاملHedge Detection Using the RelHunter Approach
RelHunter is a Machine Learning based method for the extraction of structured information from text. Here, we apply RelHunter to the Hedge Detection task, proposed as the CoNLL-2010 Shared Task1. RelHunter’s key design idea is to model the target structures as a relation over entities. The method decomposes the original task into three subtasks: (i) Entity Identification; (ii) Candidate Relatio...
متن کامل